7 research outputs found

    Smart Agriculture : A Novel Multilevel Approach for Agricultural Risk Assessment over Unstructured Data

    Full text link
    Detecting opportunities and threats from massive text data is a challenging task for most. Traditionally, companies would rely mainly on structured data to detect and predict risks, losing a huge amount of information that could be extracted from unstructured text data. Fortunately, artificial intelligence came to remedy this issue by innovating in data extraction and processing techniques, allowing us to understand and make use of Natural Language data and turning it into structures that a machine can process and extract insight from. Uncertainty refers to a state of not knowing what will happen in the future. This paper aims to leverage natural language processing and machine learning techniques to model uncertainties and evaluate the risk level in each uncertainty cluster using massive text data

    BERT Based Clinical Knowledge Extraction for Biomedical Knowledge Graph Construction and Analysis

    Full text link
    Background : Knowledge is evolving over time, often as a result of new discoveries or changes in the adopted methods of reasoning. Also, new facts or evidence may become available, leading to new understandings of complex phenomena. This is particularly true in the biomedical field, where scientists and physicians are constantly striving to find new methods of diagnosis, treatment and eventually cure. Knowledge Graphs (KGs) offer a real way of organizing and retrieving the massive and growing amount of biomedical knowledge. Objective : We propose an end-to-end approach for knowledge extraction and analysis from biomedical clinical notes using the Bidirectional Encoder Representations from Transformers (BERT) model and Conditional Random Field (CRF) layer. Methods : The approach is based on knowledge graphs, which can effectively process abstract biomedical concepts such as relationships and interactions between medical entities. Besides offering an intuitive way to visualize these concepts, KGs can solve more complex knowledge retrieval problems by simplifying them into simpler representations or by transforming the problems into representations from different perspectives. We created a biomedical Knowledge Graph using using Natural Language Processing models for named entity recognition and relation extraction. The generated biomedical knowledge graphs (KGs) are then used for question answering. Results : The proposed framework can successfully extract relevant structured information with high accuracy (90.7% for Named-entity recognition (NER), 88% for relation extraction (RE)), according to experimental findings based on real-world 505 patient biomedical unstructured clinical notes. Conclusions : In this paper, we propose a novel end-to-end system for the construction of a biomedical knowledge graph from clinical textual using a variation of BERT models

    Comparative Study of CNN and LSTM for Opinion Mining in Long Text

    No full text
    The digital revolution has encouraged many companies to set up new strategic and operational mechanisms to supervise the flow of information published about them on the Web. Press coverage analysis is a part of sentiment analysis that allows companies to discover the opinion of the media concerning their activities, products and services. It is an important research area, since it involves the opinion of informed public such as journalists, who may influence the opinion of their readers. However, from an implementation perspective, the analysis of the opinion from media coverage encounters many challenges. In fact, unlike social networks, the Media coverage is a set of large textual documents written in natural language. The training base being huge, it is necessary to adopt large-scale processing techniques like Deep Learning to analyze their content. To guide researchers to choose between one of the most commonly used models CNN and LSTM, we compare and apply both models for opinion mining from long text documents using real datasets

    Comparative study of CNN and LSTM for opinion mining in long text

    No full text
    The digital revolution has encouraged many companies to set up new strategic and operational mechanisms to supervise the flow of information published about them on the Web. Press coverage analysis is a part of sentiment analysis that allows companies to discover the opinion of the media concerning their activities, products and services. It is an important research area, since it involves the opinion of informed public such as journalists, who may influence the opinion of their readers. However, from an implementation perspective, the analysis of the opinion from media coverage encounters many challenges. In fact, unlike social networks, the Media coverage is a set of large textual documents written in natural language. The training base being huge, it is necessary to adopt large-scale processing techniques like Deep Learning to analyze their content. To guide researchers to choose between one of the most commonly used models CNN and LSTM, we compare and apply both models for opinion mining from long text documents using real datasets

    Unit Load Devices (ULD) Demand Forecasting in the Air Cargo for Optimal Cost Management

    No full text
    In recent decades, the airline industry has become very competitive. With the advent of large aircraft in service, unit load devices (ULD) have become an essential ele‐ ment for efficient air transport. They can load a large amount of baggage, cargo or mail using only one unit. Since this results in fewer units to load, saving time and efforts of ground crews and helping to avoid delayed flig‐ hts. However, a deficient loading of the units causes ope‐ rating irregularities, costing the company and contribu‐ ting to the dissatisfaction of the customers. In contrast, an excess load of containers is at the expense of cargo. In this paper we propose an approach to predict the de‐ mand for baggage in order to optimize the management of its ULD flow. Specifically, we build prediction models: ARIMA following the BOX‐JENKINS approach and expo‐ nential smoothing methods, in order to obtain more accu‐ rate forecasts. The approach is tested using the operatio‐ nal data of flight processing and the results are compared with four benchmark method (SES, DES, Holt‐Winters and Naive prediction) using different performance indicators: MAE, MSE, MAPE , WAPE, RMSE, SMPE. The results obtai‐ ned with the exponential smoothing methods surpass the benchmarks by providing more accurate forecasts

    Unit load devices (ULD) demand forecasting in the air cargo for optimal cost management

    No full text
    In recent decades, the airline industry has become very competitive. With the advent of large aircraft in service, unit load devices (ULD) have become an essential ele‐ ment for efficient air transport. They can load a large amount of baggage, cargo or mail using only one unit. Since this results in fewer units to load, saving time and efforts of ground crews and helping to avoid delayed flig‐ hts. However, a deficient loading of the units causes ope‐ rating irregularities, costing the company and contribu‐ ting to the dissatisfaction of the customers. In contrast, an excess load of containers is at the expense of cargo. In this paper we propose an approach to predict the de‐ mand for baggage in order to optimize the management of its ULD flow. Specifically, we build prediction models: ARIMA following the BOX‐JENKINS approach and expo‐ nential smoothing methods, in order to obtain more accu‐ rate forecasts. The approach is tested using the operatio‐ nal data of flight processing and the results are compared with four benchmark method (SES, DES, Holt‐Winters and Naive prediction) using different performance indicators: MAE, MSE, MAPE , WAPE, RMSE, SMPE. The results obtai‐ ned with the exponential smoothing methods surpass the benchmarks by providing more accurate forecasts

    Towards Efficient Big Data: Hadoop Data Placing and Processing

    No full text
    International audienc
    corecore